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Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program ' 
employing a logical and intuitive user-interface to check whether a sequence listing is in x 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old ; 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3,0 replaces the previous DOS-based version of Checker, and is Y2K- , 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

1 Wrapped Nucleics 

Wrapped Aminos 



8 



SUGGESTED CORRECTION SERIAL NUMBER: (W/XLh.YW 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line 'Sapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping. 1 * 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



Misaligned Amino 
Numbering 

Non-ASCII 



Variable Length 



Patcntln 2.0 
" "bug" 



7 Skipped Sequences 

(OLD RULES) 



Skipped Sequences 
'(NEW RULES) 



The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 

Sequenced) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue, Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patcntln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

scqucnces(s) Normally, Patcntln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequencc(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please aJso adjust the "(ii) NUMBER OF SEQUENCES:" response to include' the skipped sequences. 

Sequences) missing. If intentional, please insert the following lines for each skipped sequence. 

<2 1 0> sequence id number 
<400> sequence id number 
000 



Use of n's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

"(NEW RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



10 Invalid <213> 

Response 



Per 1 823 of Sequence Rules, the only valid <21 3> responses arc: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Use of<220> 



Sequencers 



12 



Patcntln 2.0 
' "bug" 



2 $ missing the <220> "Feature" and associated numeric identifiers and responses. 

UsTof^20> to <223> is MANDATORY if <2 1 3> "Organism" response is "Artificial Sequence" or 
"Unknown " Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 
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OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/866,570 



DATE : 06/15/2001 
TIME: 16:35:41 



Input Set : 
Output Set: 



A:\53679.APP.txt 
N:\CRF3\06152001\I866570.raw 




Does Not Comply 
Corrected Diskette Needed 

3 <110> APPLICANT: Croteau, Rodney et al. 

5 <120> TITLE OF INVENTION: Transacylases of the Paclitaxel Biosynthetic Pathway 

7 <130> FILE REFERENCE: 53679 

C--> 9 <140> CURRENT APPLICATION NUMBER: US/09/866, 570 

C--> 10 <141> CURRENT FILING DATE: 2001-05-25 

12 <150> PRIOR APPLICATION NUMBER: 09/411,145 

13 <151> PRIOR FILING DATE: 1999-09-30 
15 <160> NUMBER OF SEQ ID NOS : 58 

17 <170> SOFTWARE: Patentln Ver. 2.0 



ERRORED 


SEQUENCES 




























751 


<210> SEQ ID NO 


: 18 
























752 


<211> LENGTH: 302 
























753 


<212> TYPE: 


PRT 


























754 


<213> ORGANISM: 


Taxus cuspidata 


















756 


<400> SEQUENCE: 


18 
























757 


Phe 


Tyr 


Pro 


Phe 


Ala 


Gly 


Arg 


Met 


Arg 


Asn 


Lys 


Gly 


Asp 


Gly 


Glu 


Leu 


758 


1 








5 










10 










15 




760 


Glu 


Val 


Asp 


Cys 


Thr 


Gly 


Glu 


Gly 


Ala 


Leu 


Phe 


Val 


Glu 


Ala 


Met 


Ala 


761 








20 










25 










30 






763 


Asp 


Asp 


Asn 


Leu 


Ser 


Val 


Leu 


Gly 


Gly 


Phe 


Asp 


Tyr 


His 


Asn 


Pro 


Ala 


764 






35 










40 










45 








766 


Phe 


Gly 


Lys 


Leu 


Leu 


Tyr 


Ser 


Leu 


Pro 


Leu 


Asp 


Thr 


Pro 


He 


His 


Asp 


767 




50 










55 










60 










769 


Leu 


His 


Pro 


Leu 


Val 


Val 


Gin 


Val 


Thr 


Arg 


Phe 


Thr 


Cys 


Gly 


Gly 


Phe 


770 


65 










70 










75 










80 


772 


Val 


Val 


Gly 


Leu 


Ser 


Leu 


Asp 


His 


Thr 


He 


Cys 


Asp 


Gly 


Arg 


Gly 


Ala 


773 










85 










90 










95 




775 


Gly 


Gin 


Phe 


Leu 


Lys 


Ala 


Leu 


Ala 


Glu 


Met 


Ala 


Arg 


Gly 


Glu 


Ala 


Lys 


776 








100 










105 










110 






778 


Pro 


Ser 


Leu 


Glu 


Pro 


He 


Met 


Asn 


Arg 


Glu 


Leu 


Leu 


Lys 


Pro 


Glu 


Asp 


779 






115 










120 










125 








781 


Leu 


He 


Arg 


Leu 


Gin 


Phe 


Tyr 


His 


Phe 


Glu 


Ser 


Met 


Arg 


Pro 


Pro 


Pro 


782 




130 










135 










140 










784 


Xle 


Val 


Glu 


Glu 


Met 


Val 


Gin 


Ser 


Ser 


He 


He 


He 


Asn 


Ala 


Glu 


Thr 


785 


145 










-iS-O- 




















-AM- 


E--> 787 


lie 


Ser 


Asn^ 




Lys 


Gln 


Tyr 


He 


Met 


Glu 


Glu 


Cys 


Lys 


Glu 


Ser 


Cys 


788 










165 










170 










175 




790 


Ser 


Ala 


Phe 


Asp 


Val 


Val 


Gly 


Gly 


Leu 


Ala 


Met 


Leu 


Ala 


Arg 


Thr 


Lys 


791 








180 










185 










190 






793 


Ala 


Phe 


Gin 


He 


Pro 


His 


Thr 


Glu 


Asn 


Val 


Met 


Val 


He 


Phe 


Ala 


Val 


794 






195 










200 










205 








796 


Asp 


Ala 


Arg 


Arg 


Ser 


Phe 


Asp 


Pro 


Pro 


Leu 


Pro 


Lys 


Gly 


Tyr 


Tyr 


Gly 


797 




210 










215 










220 










799 


Asn 


Val 


Val 


Gly Asn 


Ala 


Cys 


Ala 


Leu 


Asp 


Asn 


Val 


Gin 


Asp 


Leu 


Leu 



file://C:\CRF3\Outhold\VsrI866570.htm 



6/15/01 



• ft 



Page 2 of 5 



RAW SEQUENCE LISTING DATE: 06/15/2001 

PATENT APPLICATION: US/09/866,570 TIME: 16:35:41 

Input Set : A:\53679.APP.txt 

Output Set: N:\CRF3\06152001\I866570.raw 

800 225 230 235 240 

802 Asn Gly Ser Leu Leu Arg Ala Thr Met He He Lys Lys Ser Lys Val 

803 245 250 255 

805 Ser Leu Lys Glu Asn He Arg Ala Lys Thr Leu Thr He Pro Ser He 

806 260 265 270 

808 Val Asp Val Asn Val Lys His Glu Asn He Val Gly Leu Gly Asp Leu 

809 275 280 285 

811 Arg Arg Leu Gly Phe Asn Glu Val Asp Phe Gly Trp Gly Lys 

812 290 295 300 

2185 <210> SEQ ID NO: 58 

2186 <211> LENGTH: 438 

2187 <212> TYPE: PRT 

2188 <213> ORGANISM: Taxus cuspidata 

2190 <400> SEQUENCE: 58 

2191 Met Glu Lys Leu His Val Asp He He Glu Arg Val Lys Val Ala Pro 

2192 15 10 15 

2194 Cys Leu Pro Ser Ser Lys Glu He Leu Gin Leu Ser Ser Leu Asp Asn 

2195 20 25 30 

2197 He Leu Arg Cys Tyr Val Ser Val Leu Phe Val Tyr Asp Arg Val Ser 

2198 35 40 45 

2200 Thr Val- Ser Ala Asn Pro Ala Lys Thr He Arg Glu Ala Leu Ser Lys 

2201 50 55 60 

2203 Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Asn Lys Glu 

2204 65 70 75 80 

2206 Asn Gly Asp Leu Glu Val Glu Cys Ser Gly Glu Gly Ala Val Phe Val 

2207 85 90 95 

2209 Glu Ala Met Ala Asp Asn Glu Leu Ser Val Leu Gin Asp Leu Asp Glu 

2210 100 105 HO 

2212 Tyr Cys Thr Ser Leu Lys Gin Leu He Phe Thr Val Pro Met Asp Thr 

2213 115 120 125 

2215 Lys He Glu Asp Leu His Leu Leu Ser Val Gin Val Thr Ser Phe Thr 

2216 130 135 140 

2218 Cys Gly Gly Phe Val Val Gly He Ser Phe Tyr His Thr He Cys Asp 

2219 145 150 155 160 

2221 Gly Lys Gly Leu Gly Gin Phe Leu Gin Gly Met Ser Glu He Ser Lys 

2222 165 170 175 

2224 Gly Ala Phe Lys Pro Ser Leu Glu Pro Val Trp Asn Arg Glu Met Val 

2225 180 185 190 

2227 Lys Pro Glu His Leu Met Phe Leu Gin Phe Asn Asn Phe Glu Phe Val 

2228 195 200 205 

2230 Pro His Pro Leu Lys Phe Lys Lys He Val Lys Ala Ser He Glu He 

2231 210 215 220 

2233 Asn Phe Glu Thr He Asn Cys Phe Lys Gin Cys Met Met Glu Glu Cys 

2234 225 230 235 240 

2236 Lys Glu Asn Phe Ser Thr Phe Glu He Val Ala Ala Leu He Trp Leu 

2237 245 250 255 

2239 Ala Lys Thr Lys Ser Phe Gin He Pro Asp Ser Glu Asn Val Lys Leu 

2240 260 265 270 

2242 Met Phe Ala Val Asp Met Arg Thr Ser Phe Asp Pro Pro Leu Pro Lys 
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RAW SEQUENCE LISTING DATE : 06/15/2001 

PATENT APPLICATION: US/09/866,570 TIME: 16:35:41 

Input Set : A:\53679.APP.txt 

Output Set: N:\CRF3\06152001\I866570.raw 



2243 






275 




- 






o o o 

280 










285 








2245 


Gly 


Tyr 


Tyr 


Gly 


Asn 


Val 


He 


Gly 


He 


Ala 


Gly 


Ala 


He 


Asp 


Asn 


Val 


2246 




290 










2 9 5 










O A A 

300 










2248 


Lys 


Glu 


Leu 


Leu 


Ser 


Gly 


Ser 


He 


Leu 


Arg 


Ala 


Leu 


He 


He 


He 


Gin 


2249 


*i r\ p 
305 










3 10 










315 










O O A 

320 


22 51 


Lys 


Thr 


lie 


Phe 


Ser 


T" ^ 

Leu 


Lys 


Asp 


Asn 


Phe 


He 


Ser 


Arg 


Arg 


Leu 


Met 


2252 










325 










330 










335 




2254 


Lys 


Pro 


Ser 


Thr 


Leu 


Asp 


Val 


Asn 


Met 


Lys 


TT J — 

His 


Glu 


Asn 


Val 


Val 


Leu 


22 55 








340 










34 5 










"J C A 

350 






2257 


Leu 


Gly Asp 


Trp 


Arg 


Asn 


Leu 


Gly 


Tyr 


Tyr 


Glu 


Ala 


Asp 


Cys 


Gly 


Cys 


2258 






355 










360 










365 








2260 


Gly 


Asn 


Leu 


Ser 


Asn 


Val 


He 


Pro 


Met 


Asp 


Gin 


Gin 


He 


Glu 


His 


Glu 


2261 




370 










375 










380 










2263 


Ser 


Pro 


Val 


Gin 


Ser 


Arg 


Phe 


Met 


Leu 


Leu 


Arg 


Ser 


Ser 


Lys 


Asn 


Met 


2264 


385 










390 










395 










400 


2266 


Gin 


Asn 


Gly 


He 


Lys 


He 


Leu 


Met 


Ser 


Met 


Pro 


Glu 


Ser 


Met 


Ala 


Lys 


2267 










405 










410 










415 




2269 


Pro 


Phe 


Lys 


Ser 


Glu 


Met 


Lys 


Phe 


Thr 


He 


Lys 


Lys 


Tyr 


Val 


Thr 


Gly 


2270 








420 










425 










430 






2272 


Ala 


Cys 


Phe 


Ser 


Glu 


Leu 






















2273 






435 




























E - > 2280 




































IP 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 
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<210> 28 
<211> 439 
<212> PRT 



<213<Artificial Sequence^^) /^JLj /d^L#>- 



H(fY} & 




<400> 28 

Met Glu Lys Thr Asp Leu His Val Asn Leu He Glu Lys Val Met Val 





VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/866,570 



DATE: 06/15/2001 
TIME: 16:35:42 



Input Set : A:\53679.APP.txt 

Output Set: N:\CRF3\06152001\I866570.raw 



L:9 M:270 C: Current Application Number differs, Replaced Application Number 
L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:787 M:340 E: (46) "n" or "Xaa" used: Feature required, for SEQ ID#:18 



L: 


1233 


M: 


258 


W: 


Mandatory Feature 


missing , 


<220> FEATURE: 










L: 


1233 


M: 


258 


W: 


Mandatory Feature 


missing , 


<223> OTHER INFORMATION 






L: 


1398 


M: 


258 


W: 


Mandatory Feature 


missing , 


<221> not found 


for 


SEQ 


ID# 


: 34 


L: 


1398 


M: 


258 


W: 


Mandatory Feature 


missing , 


<222> not found 


for 


SEQ 


ID# 


: 34 


L: 


1398 


M: 


341 


W: 


(46) "n" or "Xaa" 


used, for 


■ SEQ ID# : 34 










L: 


1409 


M: 


258 


W: 


Mandatory Feature 


missing , 


<221> not found 


for 


SEQ 


ID# 


: 35 


L: 


1409 


M: 


258 


W: 


Mandatory Feature 


missing , 


<222> not found 


for 


SEQ 


ID# 


: 35 


L: 


1409 


M: 


341 


W: 


(46) "n" or "Xaa" 


used, for 


SEQ ID# : 35 










L: 


1420 


M: 


258 


W: 


Mandatory Feature 


missing , 


<221> not found 


for 


SEQ 


ID# 


: 36 


L: 


1420 


M: 


258 


W: 


Mandatory Feature 


missing , 


<222> not found 


for 


SEQ 


ID# 


: 36 


L: 


1420 


M: 


341 


W: 


(46) "n" or "Xaa" 


used, for 


SEQ ID# : 36 










L: 


1431 


M: 


258 


W: 


Mandatory Feature 


missing , 


<221> not found 


for 


SEQ 


ID# 


: 37 


L: 


1431 


M: 


258 


W: 


Mandatory Feature 


missing , 


<222> not found 


for 


SEQ 


ID# 


: 37 


L: 


1431 


M: 


341 


W: 


(46) "n" or "Xaa" 


used, for 


SEQ ID# : 37 










L: 


1442 


M: 


258 


W: 


Mandatory Feature 


missing , 


<221> not found 


for 


SEQ 


ID# 


: 38 


L: 


1442 


M: 


258 


W: 


Mandatory Feature 


missing, 


<222> not found 


for 


SEQ 


ID# 


: 38 


L: 


1442 


M: 


341 


W: 


(46) "n" or "Xaa" 


used, for 


SEQ ID# : 38 










L: 


1665 


M: 


341 


W: 


(46) "n" or "Xaa" 


used, for 


SEQ ID#:48 










L: 


2280 


M: 


332 


E: 


(32) Invalid/Missing Amino 


Acid Numbering, 


SEQ 


ID: 58 
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